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ABSTRACT 

The leptonic decays of the heavy gauge bosons W and/or Z provide a clear experimental 
signature at hadron colliders. The production of accompanying jets is an excellent signal 
to probe QCD, while also being the main background to many searches for new physics. 
Describing the complex final state of W or Z + jets is a theoretical challenge with most 
existing calculations combining matrix elements for high energy jet production with a 
parton shower for lower energy jet production. We focus on two models: SHERPA, which uses 
Leading Order matrix elements for boson and jet production; and PDWHEG with HERWIG + +, 
which uses a Next-To-Leading Order Matrix element for Z production. In order to isolate 
the impact of the matrix elements for jet production, it is first essential to constrain the 
differences in the rest of the calculation in each case: specifically, the Multiple Parton 
Interaction models, and the tuning of the parton shower interfaced to the matrix elements. 
We test all three aspects of these models against data from the Tevatron, and perform a 
study of some basic kinematic variables at the LHC energy. 



*flavia.dias@cern.ch 
^nurse@hep. ucl.ac.uk 
* gavin . hesketh@cern .ch 



1. INTRODUCTION 



The leptonic decays of the heavy gauge bosons W and/or Z accompanied by jets, W — > 
tv + jets and Z — >■ U + jets, offer search channels for new interactions of particles in 
high energy collisions. Many extensions of the Standard Model predict new particles 
with electroweak (EWK) couplings that decay into the SM gauge bosons W, Z, and 7, 
accompanied by jets. Any production of new heavy particles with quantum numbers 
conserved by the strong interaction and EWK couplings is likely to contribute to signatures 
with one or more EWK gauge bosons; additional jets will always be present at some level 
from initial-state radiation, and may also be products of cascade decays of new heavy 
particles. W + N jets and ti + jets are backgrounds to most supersymmetry searches in 
final states with leptons, jets and missing transverse energy. The Z(—> vv) + jets signal 
is an irreducible background to inclusive hadronic searches of Dark Matter, based on jets 
and missing energy (MET). W or Z + jets is also a major background for Higgs searches. 

In the context of the SM, the study of the production of electroweak bosons with N jets 
allows for tests of perturbative QCD. The production cross section scales approximately 
with the strong coupling constant for each additional jet. While current theoretical pre- 
dictions at leading order (LO) and next to leading order (NLO) are in good agreement 
with data at the Tevatron, comparison at the higher energy of the LHC, and at higher jet 
multiplicities are needed. Inclusive and differential cross sections access the parameters 
of the perturbative expansion, such as the renormalization and the factorization scales, as 
well as the parton density functions (PDFs), through the and r\ distributions of the 
vector bosons and the associated jets. Predicting all these quantities and comparing them 
to the Tevatron data has already produced several improvements in the calculation and 
generation techniques, such as the introduction of generators based on the LO calculations 
of matrix-element (ME) for the associated jet production, and the definition of several 
matching procedures to the parton-shower generators. 

In this paper we make a comparative study of ME+PS Monte Carlo generators (SHERPA 
and HERWIG + +) with an NLO calculation (POWHEG), validated with Tevatron DO and CDF 
data, and generated at the LHC energy. This will serve as a preparatory MC study of the 
major SM backgrounds for the regions that are relevant to New Physics searches. 

2. MATRIX ELEMENTS CORRECTIONS / PARTON SHOWER 

The Parton Shower (PS) technique is a collinear approximation of the description of parton 
splittings in QCD radiation that accompanies hard scattering processes. 

However, although it provides a good description of low p± observables, it usually fails 
to fill the phase space (can't emit anything above the factorisation scale by construction) 
and in the hard region (close to the factorisation scale) this approximation is simply no 
good anymore. One way to improve the description of kinematical observables is to add 
a Matrix Element (ME) correction for the extra emissions. This can be implemented in 
different ways, as will be described in each generator section. 

2.1 SHERPA 

The SHERPA generator uses the improved CKKW Matrix Element - Parton Shower merging 
[1] that relies on a separation of the event phase space in two separated regions, defined 
by a choice of scale. Above the chosen scale, all radiation must be produced by the Matrix 
Elements, and below the chosen scale all radiation produced by the Parton Shower. In 
matching these two regions to provide full phase space coverage, overlaps such as a parton 
shower emission in the scale range covered by the Matrix Element, must be removed [2]. 

The basic steps in the algorithm implementation are [2]: 



• Events are generated based on the matrix elements for W/Z + k jets, where k = 0, 
... N. The Matrix Element jets are required to be above the merging scale, Qcut- 



• The event is then reconstructed back from the final state particles. Particles are com- 
bined in the most likely combinations according to the parton shower probabilities. 

• Events are then passed to the parton shower, which is allowed to generate radiation 
from any part of the process. 

• The shower is then analyzed, and any events in which parton shower produces an 
emission above the scale Q cu t are vetoed, leading to a rejection of the full event. This 
process is called Sudakov rejection. 

The SHERPA generator automatizes the generation of inclusive samples, combining ME 
for different parton multiplicities with PS and hadronization. Some parameters related to 
the ME and PS calculations have to be set accordingly. 

2.2 HERWIG + + 

The aim of the ME corrections in the PS on HERWIG + + is to correct for two deficiencies 
in the shower algorithm: to populate the uncovered region of high p± in the phase space 
(non-soft non-collinear) , and to correct the populated region, where the extrapolation away 
from the soft and collinear limits is not perfect. These corrections are called the hard and 
soft matrix element corrections respectively [3]. 

Soft Matrix Element Corrections 

The soft correction is derived by comparing the probability density that the i th resolvable 
parton is emitted into a region of the phase space in the PS approximation (quasi-collinear 
limit), and the exact ME calculation. A simple veto algorithm is then be applied to the 
parton shower to reproduce the matrix element distribution, which relies on there always 
being more parton shower emissions than matrix element emissions. This is ensured simply 
by enhancing the emission probability of the parton shower with a constant factor [3]. 

The correction is applied to the hardest emission so far in the shower, to ensure that 
the leading order expansion of the shower distribution agrees with the leading order matrix 
element, and that the hardest (i.e. furthest from the soft and collinear limits) emission 
reproduces it [4]. 

Hard Matrix Element Corrections 

The hard ME corrections aim to populate the high p± region that the PS leaves uncovered. 
This domain of the phase space should have radiation distributed according to the exact 
tree level ME for this extra emission, and as the Parton Shower does not populate this 
region, a different approach is needed to achieve this. Prior to any showering, the algorithm 
checks if the required ME is available for the hard process. Then, a point is generated in the 
appropriate region of phase space, with a probability based on a sampling of the integrand. 
The differential cross section associated with this point is then calculated and multiplied 
by a phase space volume factor, giving the event weight. The emission is accepted if the 
weight is less than a uniformly distributed random number in the [0,1] interval, and the 
momenta of the new parton configuration is processed by the shower as normal [3] . 



3. NEXT TO LEADING ORDER METHODS 



Both SHERPA and HERWIG + + described above include dominant QCD effects with leading 
order matrix elements combined with a leading logarithmic parton shower. However, higher 
order calculations are required to match the precision of current data measurements. Going 
even one step beyond the leading order is already a complex task: the initial hard process 
should be implemented in NLO; and shower development would have to be improved in 
next to leading logarithmic accuracy in collinear and soft structure. An intermediate 
step has already been developed: keeping the leading logarithmic order for the shower 
approximation, while improving the treatment of the hard emission to NLO accuracy 
(NLO+PS approach). We test one such generator: PDWHEG. 

3.1 POWHEG 

In the POWHEG (Positive Weight Hardest Emission Generator) formalism, the generation of 
the hardest emission is performed first, using full NLO accuracy, and using the HERWIG + + 
parton shower to generate subsequent radiation. The POWHEG cross section for the genera- 
tion of the hardest event has the following properties: 

• At large kx (momentum of incoming particle) it coincides with the pQCD cross 
section up to (0(a|)) terms. 

• It reproduces correctly the value of infrared safe observables at the NLO. Thus, also 
its integral around the small kx region has NLO accuracy. 

• At small kx it behaves as well as standard shower Monte Carlo generators. 

The POWHEG formula can be used as an input to any parton shower program to perform 
all subsequent (softer) showers and hadronization. However, as kx is used to define the 
matrix element scale, the parton shower must also be ordered in p±. The shower is then 
initiated with an upper limit on the scale equal to the kx of the POWHEG event, and fills 
in all radiation below that scale (a truncated shower). For the case of a virtuality or 
angular ordered shower, emissions at higher kx may be produced, and must subsequently 
be vetoed to avoid double counting with the ME emission: a vetoed truncated shower, 
which is not possible with current parton shower programs like PYTHIA and HERWIG. We 
point out, however, that the need of vetoed truncated showers is not specific to the POWHEG 
method. It also emerges naturally when interfacing standard matrix element calculations 
with parton shower. At present, there is no evidence that the effect of vetoed truncated 
showers may have any practical importance [5]. 

The POWHEG method solves the problem of negative event weights that arise in other 
NLO methods, such as in MC@NL0. It also defines how the highest p± emission may be 
modified to include the logarithmically enhanced effects of soft wide-angle radiation. In the 
POWHEG framework, positive weight events distributed with NLO accuracy can be showered 
to resume further logarithmically enhanced corrections by [3]: 

• Generating an event according to the POWHEG formula; 

• Hadronizing non-radiating events directly; 

• Mapping the radiative variables parametrizing the emission into the evolution scale, 
momentum fraction and azimuthal angle, from which the parton shower would re- 
construct identical momenta; 



• Evolve, using the original LO configuration, the leg emitting the extra radiation from 
the default initial scale, determined by the colour structure of the N-body process, 
down to the hardest emission scale such that the p± is less than that of the hardest 
emission, the radiation is angular-ordered and branchings do not change the flavour 
of the emitting parton; 

• When the evolution scale reaches the hardest emission scale, insert a branching with 
parameters into the shower; 

• From all external legs, generate p± vetoed showers. 

This procedure allows the generation of the truncated shower with only a few changes to 
the normal HERWIG + + shower algorithm [3]. 

4. COMPARISONS TO TEVATRON DATA 

The inclusion of Matrix Elements is only one component of the simulation. In order to 
describe real data, other aspects of the generators must also be accurate: the Parton 
Shower (PS), and the model of Multiple Parton Interactions (MPI). The choice of Parton 
Distribution Function (PDF) may also play a role, or at least be highly coupled to the 
tuning of these phenomenological models. Finally, there are some settings unique to the 
generators themselves, such as the choice of matching scale between the parton shower and 
matrix elements. So, in order to isolate the impact of using LO or NLO matrix elements, 
we must also constrain all the other aspects of these models. 

4.1 Selection of Events and Kinematic Cuts 

We use the most recent data from the Tevatron experiments, CDF and DO, to test the 
generator performance. The Tevatron is a proton anti-proton collider with a center of 
mass energy yfs =1.96 TeV, located at Fermilab, USA. 

For each of the analyses, Monte Carlo events are selected according to the correspond- 
ing data selection, described in their papers. But, in general, the lepton pair invariant 
mass is required to be between some mass range (around the Z mass peak) to enhance 
the contribution of pure Z exchange over 7* exchange and Z/-f* interference terms, and 
pseudorapidity cuts on the (CDF or DO) detector acceptance. 

The generators were configured to produce inclusive Z/7* particles decaying into lep- 
ton pairs (electrons or muons), with the invariant mass constraint. Also some generator 
parameters and inputs are changed in order to obtain the best description of the data: the 
intrinsic momentum of the partons in the incoming (anti)protons (K_PERP), MPI model 
parameters and the input PDFs. 

The comparison plots were made using the RIVET framework [6], version 1.2.1. The 
RIVET project (Robust Independent Validation of Experiment and Theory) allows valida- 
tion of Monte Carlo event generators. It provides a set of experimental standard validated 
analyses useful for generator validation and tuning, as well as a convenient infrastructure 
for adding user's own analyses. 

4.2 Z Transverse Momentum 

The main benefit to using Z events to probe the underlying process is that the Z can be 
fully and unambiguously reconstructed. The Z p± is generated by the momentum balance 
against initial state radiation (ISR) and the primordial/intrinsic p± of the Z's parent 
partons in the incoming hadrons. Within an event generator, this recoil may be generated 
by a hard matrix element at high p±, or by the parton shower or intrinsic momentum 
of the partons at low p±. The inclusive Z p± is therefore an excellent first test for the 
generators, before looking at more exclusive Z + jet final states. 



We look at three measurements of the Z p±: the Run I measurement by CDF, and two 
Run II measurements by DO, in the electron and muon channels. 

CDF Run I result The Z p± analysis from CDF Run I [7] is a measurement of the cross 
section as a function of the transverse momentum of e + e~ pairs in the Z boson mass region 
of 66 GeV < m ee < 116 GeV from pp collisions at y/s = 1.8 TeV, with the measured lepton 
acceptance extrapolated to Air with no p± cut. The analysis is also subject to ambiguities 
in the experimental Z p± definition [7]. 

Fig. [T] shows that the MC description for both HERWIG + + Z NLO generation in the 
POWHEG formalism and SHERPA Z+3 jets in the PS+ME formalism agree with CDF data, 
inside the experimental uncertainties, except for the low Z p± region for HERWIG + +. 
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Figure 1. Z pj_ analysis for Tevatron Run I. The yellow band in the ratio plot correspond to the 
uncertainty of the data. 



DO Run II results The DO Z (— > e + e~) p± analysis [8] is based on pp collisions at y/s 
= 1.96 TeV, with a looser lepton pair mass cut of 40 < m ee < 200 GeV. The measured 
differential spectrum is normalized to the total Z cross section, to reduce overall systematic 
uncertainties. The electrons were measured in \r]\ < 1.1 or 1.5 < \rj\ < 2.5, with p± > 25 
GeV. The result was extrapolated to 4ir with no p± cut. 

We can see in Fig. [2] (left) that the DO analysis in the electron Z decay channel has a 
systematic behaviour in the region of medium Z p±, for both HERWIG + + Z NLO (POWHEG 
formalism), HERWIG + + Z LO with ME formalism and SHERPA Z + 3 jets. The NLO 
plot shows slightly better behaviour in the mid region (30-60 GeV) than the LO with ME 
corrections, but no significant differences in the low region. The high p± region lacks the 
statistics to make a detailed comparison. 

The middle region suggests a problem either with the the Monte Carlo or something 
in the analysis. We therefore looked also at a new DO Z p± analysis [9] in the muon 
channel. The muons were measured in \r]\ < 1.7 and p± > 15 GeV. This new analysis has 
as an important development: the definition of the final observable is made at the level 
of particles entering the detector, while previous measurements have applied theoretical 
factors correcting for any final state radiation and for the measured lepton acceptance to 
full 4-7T coverage. This new approach minimises the dependence on theoretical models, and 
therefore any biases in comparisons. The differential cross section is normalized to the 
total Z cross section, to reduce overall systematic uncertainties, as in the electron channel 
analysis [8]. 



We can see in Fig. [2] (right) that in the muon channel the discrepancy in the medium Z 
transverse momentum region is gone, and the ratio of MC to data does not show significant 
systematic discrepancies for both generators. So the Monte Carlo can better reproduce 
the analysis with the more limited lepton acceptance and without the model dependent 
corrections. For HERWIG + + LO without ME corrections, the expected behaviour is to 
produce fewer events in the high transverse momentum region, as this is not fully populated 
by the PS formalism alone. 
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Figure 2. Comparison of boson transverse momentum for Z production at: NLO HERWIG + + 
(POWHEG formalism), LO HERWIG + + (ME corrections on and off) and LO SHERPA Z + 3 jets, in 
the Z — > e + e~ channel (left) and Z — > n + n~ channel (right). Both plots are scaled by 1 over the 
total cross section, to reduce systematic uncertainties. 



4.2.1 PDF and K PERP choice on SHERPA 

For the sake of using the best parameters for the description of Tevatron data, some other 
parameters were studied in the SHERPA generator: the impact of the use of different PDFs 
and different values for intrinsic transverse momentum of the partons within the hadrons 
(K_PERP_MEAN) and its gaussian width (K_PERP_SIGMA). 

The same Z (— > /x + /i _ ) p± analysis [9] of the previous section is used, for the reasons 
discussed above. 

Intrinsic Transverse Momentum of Partons Within the Hadrons In Fig. [3] (left) 
we show the transverse momentum of the Z boson in Z+3 jets production with different 
values for the transverse momentum of the partons within the hadrons. We can see that 
the K_PERP_MEAN = 1.4 (the default value is 0.8) and K_PERP_SIGMA = 0.8 (default value) 
shows a better agreement in the lowest Z p± bin. 

PDF Set With Different 0:5 Values In order to show the impact of changing the 
PDF on the shape of the Z p±, the analysis was run with the MSTW08 [10] set, that is a 
set of PDFs fitted with different values of the strong coupling constant in the Z mass pole, 
as(Mz) = 0.11, 0.13. Fig. [3] shows the change introduced by varying as- These were 
compared to two other PDFs: the default in SHERPA, CTEQ6.6, and to CTEQ6L1. The last 
provides a better description of the Z p± data, especially in the low transverse momentum 
region. For HERWIG + + with POWHEG, a NLO PDF is necessary, and all the simulations 
used the default one, MRST'02 NLO [11]. 
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Figure 3. The Z p± in SHERPA for several different parameters of K_PERP and its gaussian width 
(left), using CTEQ6L1 PDF, and for different PDFs (right), using default values of K_PERP (0.8) and 
its width (0.8), equivalent to the blue curve on the left graphic. In this plots, no MPI model was 
simulated. The plots are scaled by 1 over the total cross section, to reduce systematic uncertainties. 



4.3 Multiple Parton Interactions 

With both generators providing a reasonable description of the Z p±, we can study the 
rest of the event in more detail. However, before looking at the jets recoiling against the 
Z, we must first constrain the other source of hadronic activity, MPI. 

4.3.1 Constraining UE/MPI 

To study the model of the MPI in each of the generators used, the CDF underlying event 
analysis [12] was compared to both the HERWIG + + and SHERPA generators. 

The analysis was made for Drell-Yan events with Z/7* —> e + e~ and Z/j* — > 
A mass cut 70 < mu < 110 GeV was applied at the generator level. The analysis is 
based on the observation that the hard interaction in an event typically falls along an axis, 
and activity from MPI is completely uncorrelated with this axis. Each event is therefore 
decomposed into regions in the azimuthal angle, 0. The "toward" region defined by the 
direction of the Z, which is used to set 4> — 0. The opposite direction, the "away" region, 
is then dominated by the recoil to the Z. The "transverse" regions, defined by 60 < <f> < 
120, generally have little activity from the hard interaction, and so are most sensitive to 
MPI and the underlying event. In the analysis, the transverse region is defined for \rj\ < 1 
[12]. 

Fig. [4] shows the comparison of HERWIG + + to the CDF data with the MPI model 
turned on and off. There appears to be a disagreement in the observed UE, with HERWIG + + 
always producing less activity than the data in the transverse region. The default settings 
for the MPI model in HERWIG + + were tuned to provide the best fit to the jet data from 
Run I and Run II. Adjusting some of these parameters does not yield a significant im- 
provement in the description of the Z data, however [3]. 

For the SHERPA generator, comparisons are made with the default settings for MPI 
[13]. Changing the PDF to CTEQ6L1, which provided the best description of the Z p±, 
significantly degrades the MPI model performance, most probably due to the different 
value of as {M-z) an d running of a s between the two PDF sets. The most important 
parameter in tuning the MPI model is the scale of the transverse momentum cutoff, and 
three values are tested with the CTEQ6L1 PDF: 2.1, 2.3 and 2.5 GeV. The plots in Fig. [5] 
show that the best settings for SHERPA are the CTEQ6L1 PDF with the scale set to 2.5 GeV. 
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Figure 4. The transverse region charged particle density (left) and the transverse region charged 
p sum density (right) in Underlying Event analysis for HERWIG + + Z NLO with MPI on and off. 
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Figure 5. The transverse region charged particle density (left) and the transverse region charged 
psum density (right) in Underlying Event analysis for SHERPA Z + 3 jets with MPI on, and different 
PDFs: CTEQ6.6 with standard MPI scale tuning or CTEQ6L1 with scale parameter equals to 2.3 and 
2.5 GeV. It also shows the performance when the MPI is turned off. 



4.4 Z + jets - HERWIG + + and SHERPA LO vs POWHEG NLO 

We now return to our main aim: to assess the impact of NLO matrix elements in the 
simulation of Z (+ jets) events. We use the following generator configurations: LO without 
ME correction, LO with ME correction and NLO (POWHEG formalism), for the HERWIG + + 
generator; and Z + 3 jets at LO for the SHERPA generator. 

Table [TJ shows a comparison of the total Z — > e + e~ cross section measured at the 
Tevatron CDF experiment [14]. As expected, the NLO simulation has the best prediction 
of the cross section, while at LO, HERWIG + + has a slightly better prediction than SHERPA. 
Further comparisons between the ME+PS merging in HERWIG + + and SHERPA (besides 
ALPGEN and PYTHIA) can be found in reference [15]. 

Z boson rapidity Fig. [6]shows the DO measurement of the Z — > e + e~ cross section as a 
function of the Z boson rapidity [16]. All generators describe the data well. The differential 
cross section is also normalized to the total cross section for Z — > e + e~ production. 
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Table 1. The total cross sections for Z— > e + e production in data, SHERPA and HERWIG + + Monte 
Carlo generators. The parameters for HERWIG + + are the default, with MPI simulation. For 
SHERPA, PDF CTEQ6L1, MPI with scale 2.5 GeV, and optmized K_PERP parameters. 
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Figure 6. Comparison plots for Z production at: LO (ME correction off), LO (ME correction on) 
and NLO for HERWIG + +, and Z+3 jets LO for SHERPA: the cross section as a function of the 
Z boson rapidty. The plot is normalized to its integral. The parameters for HERWIG + + are the 
defaults, with MPI simulation. For SHERPA, PDF CTEQ6L1, MPI with scale 2.5 GeV, and optmized 
K_PERP parameters. 



Jet multiplicity In Fig. [7] (left) the CDF Z — >■ e + e~ [10] analysis shows the cross 
section as a function of jet multiplicity. The jets are required to have p± > 30 GeV and to 
be within the rapidity range, \yj e t\ < 2.1. The plot is then normalized to the cross section 
of production of Z+l jet (first bin) in data. This is useful to test the relative fraction of 
two and three jet events. We can see that the SHERPA Z+3 jets generator can describe, 
inside data uncertainties, the expected number of events with two and three jets, while 
HERWIG + + fails for jet multiplicities higher than one. 

The DO Z —7- e + e~ analysis [17] measures the n-jet cross section ratios and is shown in 
Fig. [7] (right). The distribution is inclusive and normalized to the total Z — > e + e~ data 
cross section (i.e., the bin of zero or more jets). Here we can see that the HERWIG + + Z 
NLO , HERWIG + + LO with ME corrections and SHERPA Z + 3 jets predictions describe the 
first jet bin well. For higher jet multiplicities, the SHERPA Z + 3 jets generator describes 
the data better, however it fails for four jets, as one may expect. HERWIG + + LO without 
ME corrections does not describe the data in any bins. 



Jet transverse momentum The DO analysis in [18] measures the differential cross 
section as a function of the transverse momentum, of the three leading jets in the production 
of Z/j* —7- e + e _ + jets + X, normalized to the total cross section of Z/7* — > e + e~ 
production. Fig. [8]shows comparisons with the HERWIG + + generator at NLO (POWHEG 
formalism) and at LO with and without ME corrections, and with SHERPA Z + 3 jets. 
The LO without ME corrections shows the expected failure of the parton shower alone 



Jet multiplicity 



Inclusive jet multiplicity 



14 
1.2 
1 

0.8 
0.6 
0.4 
0.2 



-+- 



CDF data 
HERWIG++ Z NLO 
HERWIG++ Z LO ME on 
HERWIG++ Z LO ME off 
SHERPA Z+3 jets 



Al 
+ 



0.8 
0.6 

04 
0.2 



DO data 

HERWIG++ Z NLO 

HERWIG++ Z LO ME 

HERWIG++ Z LO 

SHERPA Z+3 jets 



«jet «jet 

Figure 7. Comparison plots for Z production at: LO (ME correction off), LO (ME correction on) 
and NLO, on HERWIG + +, and Z+3 jets LO on SHERPA. In the left, the cross section prediction 
as a function of jet multiplicities, normalized to the first bin (the data cross section for one jet 
in the event). The right plot shows the ratios of the expected ro-jets cross sections to the Z 
total production cross section (first bin). The parameters for HERWIG + + are the defaults, with 
MPI simulation. For SHERPA, PDF CTEQ6L1, MPI with scale 2.5 GeV, and optimized K_PERP 
parameters. 



to populate the high p± region, because it corresponds to the phase space where the PS 
can't fill properly. When the ME correction is turned on, the effect in correcting the high 
transverse momentum region can be seen. The behaviour for the NLO plot is more close 
to the data in the higher p± region (above 130 GeV), while it has similar results to LO 
with ME corrections at lower p±. However, for the second and third leading jets, the 
description is also not good - NLO Z production includes the LO matrix element for one 
jet production, but the second and third jets are produced only by the parton shower, 
which again underestimates the data. For the SHERPA Z + 3 jets, the matrix element for 
the further jet emissions allows for a description similar to the NLO curve for the leading 
jet, and does a better job for the second and third leading jets. 

DO Z — > Analysis To further study the jet recoil and Z boson p±, the DO 

analysis with Z (— > /i + + jets [19] was used. It measures the cross sections as a function 
of the boson momentum, and as a function of momentum and rapidity of the leading jet. 
It is required at least one jet in the event, with p± > 20 GeV and \rj\ < 2.8. This analysis 
doesn't normalize the data by the Z total cross section production - the shape of the plots 
are normalized by their integral. 

As can be seen in Fig. |9j the cross section in leading jet rapidity is well behaved inside 
the uncertainties, however HERWIG + + Z LO without ME corrections tends to produce 
a wider distribution. The Z p± shows imprecisions in the low momentum region: for 
HERWIG + + Z NLO the production in low Z p± is around 40% lower than data, and in 
SHERPA it's greater than data. This region is particularly sensitive to jets produced by 
MPI, which are not recoiling against the Z boson. 

The leading jet p± does not show any significant discrepancy in both cases. However, 
a behaviour that one can see is the deficit of events in the Monte Carlo compared to the 
data in the range of 50 < p± < 120 GeV. The Z pj_ shows no such deficit (see Fig. [2]), 
suggesting the MC is not fully describing the hadronic recoil of the Z, and this is studied 
in detail in the appendix. 

HERWIG + + LO with ME corrections show similar behaviour to the NLO prediction, 
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Figure 8. Comparison plots for Z production at: LO (ME correction off), LO (ME correction on) 
and NLO for HERWIG + +, and Z+3 jets LO for SHERPA. The transverse momentum of the leading, 
second and third leading jet respectively. All the plots are normalized to their integrals. The 
parameters for HERWIG + + are the defaults, with MPI simulation. For SHERPA, PDF CTEQ6L1, 
MPI with scale 2.5 GeV, and optmized K_PERP parameters. 



and without the ME corrections has a completely different shape in all plots, due to the 
previously discussed issues in populating the high p± regions. 



5. LHC ANALYSES CUTS 

After choosing the Monte Carlo parameters for the HERWIG + + Z NLO (in POWHEG for- 
malism) and SHERPA Z + 3 jets (LO), comparison plots for Z — > e + e~ in the LHC energy 
(first Run - 7 TeV) were performed, using the following kinematic cuts (taken from [20]): 



• Transverse momentum of the lepton p±(l) > 15 GeV; 

• Absolute value of the lepton pseudorapidity \rj(l)\ < 2.4; 

• Transverse momentum of the jet p±(j) > 20 GeV; 

• Absolute value of the jet pseudorapidity \rj(j)\ < 4.5; 

• Lepton isolation criteria: ARu > 0.2; AMu > 0.4; 
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Figure 9. Comparison plots for Z production at LO and NLO for HERWIG + + and LO for SHERPA 
Z+3 jets, in the muon channel: upper left, the Z p±, upper right, the leading jet rapidity, and 
lower, the leading jet p±. All plots are normalized to their integrals. For both generators MPI is 
included. The analysis asks for at least one jet with p± > 20 GeV and \r)\ < 2.8. HERWIG + + has 
the default parameters. The SHERPA PDF is CTEQ6L1, the MPI scale parameter is 2.5 GeV, and 
optmized values of K_PERP are used. 



The jets are reconstructed with the antikT clustering algorithm, with cone radius R=0.7. 
A mass cut on the leptons invariant mass 60 < Mu < 110 GeV was applied. The events 
generated correspond to a integrated luminosity of 1 

the transverse momentum 



The kinematic variables plotted are shown in Figs. 10 to 15 



of the first, second and third leading jets, and the inclusive p± for 1 and 2 or more jets in the 
event; the transverse momentum of the two leptons used to reconstruct the Z boson, and 
of the Z; The pseudorapidity of Z and jets; the invariant mass of the Z; the jet multiplicity 
in the event. All the plots are normalized to their integral, except the jet multiplicity, that 
is the cross section of production of the event with n-jets. 



The Z p± (Fig. 10 left) shows a discrepancy in the low transverse momentum region: 
SHERPA simulates fewer events in this region, compared to HERWIG + +. However, for the 
leading electrons p± of the event (Fig. 10 center and right), the discrepancy isn't as much 
as for the boson p±. 

For the leading jet p± (Fig. 11 left), the behaviour is also discrepant - the SHERPA 
generator simulates more events in the low transverse momentum region, and fewer events 
in medium and high transverse momentum, although in the later ones it agrees with 



HERWIG + + inside the statistical uncertainties. For the second and third leading jets p± 



(Fig. 11 center and right), there is good agreement inside MC uncertainties. 

The jet p± for Nj e f > 1 and Nj e t > 2, that is the transverse momentum of all jets in 
the event that pass the cuts p± > 30 GeV and \rj\ < 2.1 for events with at least one or 



two jets (Fig. 12 left and center, respectively) agree in both generators inside the errors, 



as well as the invariant mass of the Z boson (Fig. 12 right). 



The pseudorapidity for the Z boson and leading electrons (Fig. 13) agree for the 
generators, inside the errors. However, for the leading jets, the behaviour is different: 
while for the leading jet (Fig. [l4]left) the region in which |?7| < 3 is agreed, the region in 
the range 3 < \r]\ < 5 shows fewer events for the HERWIG + + generator than for SHERPA. 



For the second leading jet (Fig. 14 center), SHERPA has more events in the central region 
and fewer events in the range 3 < \rj\ < 5. For the third leading jet (Fig. 14 right), due 
to high statistical errors, both generators agree in full range. In the jet inclusive rj for 

\rj\ < 5 shows more events for SHERPA generator, 
center) the generators agree inside errors. 



Njet > 1 (Fig. [15] left) the range of 3 < 
while for N jet > 2 (Fig. ~ 



15 



For the cross section as a function of jet multiplicity (Fig. 15 right), the simulation of 
Z production at NLO (POWHEG formalism) has a greater cross section than SHERPA Z+3 
jets at LO for the production of one and two jets. However, because SHERPA takes into 
account matrix elements up to 3 jets, the description for higher jet multiplicities have a 
greater cross section for the SHERPA generator. The data will usually have greater cross 
sections than predicted with the generators, because for having equal cross sections one 
should simulate up to infinite orders of QCD. 
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Figure 10. Comparison plots for LHC (7 TeV) energy, for Z production in electron channel: the 
Z p± (left), leading electron p± (center) and 2nd leading electron p± (right), for HERWIG + + Z 
NLO and SHERPA Z+3 jets, both with MPI simulation, and SHERPA with optimized K_PERP and 
CTEQ6L1 PDF, scale parameter 2.5 GeV. 
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Figure 11. Comparison plots for LHC (7 TeV) energy, for Z production in electron channel: the 
leading (left), second (center) and third (right) leading jet p ±1 for HERWIG + + Z NLO and SHERPA 
Z+3 jets, both with MPI simulation, and SHERPA with optimized K_PERP and CTEQ6L1 PDF, scale 
parameter 2.5 GeV. 
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Figure 12. Comparison plots for LHC (7 TeV) energy, for Z production in electron channel: the 
jet p± for inclusive Nj e t > 1 (left) , jet p± for inclusive Nj et > 2 (center) and Z invariant mass 
(right), for HERWIG + + Z NLO and SHERPA Z+3 jets, both with MPI simulation, and SHERPA with 
optimized K_PERP and CTEQ6L1 PDF, scale parameter 2.5 GeV. 
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Figure 13. Comparison plots for LHC (7 TeV) energy, for Z production in electron channel: the Z 
T] (left), leading electron 77 (center) and 2nd leading electron 77 (right), for HERWIG + + Z NLO and 
SHERPA Z+3 jets, both with MPI simulation, and SHERPA with optimized K_PERP and CTEQ6L1 
PDF, scale parameter 2.5 GeV. 
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Figure 14. Comparison plots for LHC (7 TeV) energy, for Z production in electron channel: the 
leading (left), second (center) and third (right) leading jet 77, for HERWIG + + Z NLO and SHERPA 
Z+3 jets, both with MPI simulation, and SHERPA with optimized K_PERP and CTEQ6L1 PDF, scale 
parameter 2.5 GeV. 
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Figure 15. Comparison plots for LHC (7 TeV) energy, for Z production in electron channel: the jet 
77 for inclusive Nj et > 1 (left), jet 77 for inclusive Nj et > 2 (center) and jet multiplicity (right), for 
HERWIG + + Z NLO and SHERPA Z+3 jets, both with MPI simulation, and SHERPA with optimized 
K_PERP and CTEQ6L1 PDF, scale parameter 2.5 GeV. 



6. CONCLUSIONS 

We studied the effects of adding a next to leading order term in the Monte Carlo simulation 
for Z production in hadron colliders. In addition, the influence of adding a matrix element 
correction in the leading order calculations for improving the showering in the parton 
shower formalism, and the influence of some theoretical parameters that enter as input in 
the Monte Carlo programs. 

We could see that the use of the NLO term, studied here in the PDWHEG formalism as 
implemented in the HERWIG + + generator, improves the prediction of the cross sections of 



the processes, as well as the behaviour of the physics observables, specially in the region 
of higher transverse momentum. The implementation of the matrix element correction 
in the parton shower formalism also improves the description of the data in the region 
of high transverse momentum, compared to the calculations in leading order without the 
correction. 

It was also seen that both SHERPA and HERWIG + + generators show a systematic be- 
haviour lower than the Tevatron data in the region of mid-range transverse momentum 
of the Z boson. The DO analysis performed in the muon channel has a better Monte 
Carlo description than the one performed in the electron channel, and doesn't use any 
kind of extrapolation based in theory. According to the new muon analysis, if the same 
theory based corrections are applied, the muon data agree with the electron channel data. 
So, these theoretical dependent corrections could be responsible for the disagreement and 
the systematic behaviour between the Monte Carlo and the Tevatron data in mid range 
transverse momentum of the Z. 

The underlying event analysis showed that it is possible to choose a good tune for the 
parameters in the multiple parton interactions model (Amisic) for the SHERPA generator, 
using the PDF that best describes the Z p± data (CTEQ6L1). However, for the HERWIG + + 
generator, there was no parameter selection that could describe well the underlying event 
data, and the setup used was the standard one, based in the best description of other 
physics observables by the authors. 

In the analysis of the balance of the Z p± against the leading jet p± and the sum of 
p± of all jets in the event was possible to see the effects that the MPI model have in the 
region of low transverse momentum. This is done in the appendix. 

The analysis for LHC first run energy (7 TeV) shows that some kinematic quantities 
have a different prediction from HERWIG + + Z NLO when compared to SHERPA Z+3 jets 
simulation, such as the low region of the Z boson transverse momentum and leading jet p± 
and leading jets pseudorapidity in the high absolute value of r] range. For the cross section 
of jet production, the HERWIG + + Z NLO generator predicts a higher cross section for the 
first and second jets, while the SHERPA Z+3 jets show higher cross sections for higher jet 
multiplicities. A full comparison once LHC has enough luminosity will show some features 
that the Monte Carlo generators will have to be able to deal with, implementing in their 
processes new information that the new data will provide. 
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Appendix 

Jet Recoil Because primarily the Z pj_ should be balanced in the event with the pj_ of 
the leading jet (or all jets in the event), the study of the shape of the jet p± should give 
some information about the systematic difference in the description of the Z — > e + e~ p±, 
and the leading jet p± in Z events. With the data, it is impossible to isolate the different 
effects of MPI and hard scatters, or extend below the p± cuts used in analysis. So, we 
perform some studies to check the pj_ balance in the MC samples, to see if the Z pj_ was 
summing up to zero when added to the p± of the leading jet, the sum of the p± of all jets 
and, for a more fundamental check of balance, with the sum of the p± of all particles found 
in the event. 



The first sanity check was done to assure that the sum of the transverse momentum of 
all particles in the event was balanced, so that both MC generators and RIVET were dealing 
with the events correctly. After this check, the Z p± balance against the sum of the p± 
of all jets in the event and leading jet was analyzed, for both SHERPA and HERWIG + + 
generators, with multiple parton interactions (MPI) turned on and off. 

For the study of the balance of the leading jet, the Tevatron analysis cuts are used 
(p_I_(jet) > 20 GeV and \rj\ < 2.1), so it can be comparable to the jet p± and Z p± spectra. 
In the other hand, very loose cuts (p_i_(jet) > 5 GeV and \rj\ < 10) were applied to analyse 
the balance of the Z p± against the sum of all jets in the events, so it's possible to truly 
test the balance between jets and Z p± at generator level. 



Fig. 16 shows that with the MPI model turned on, the SHERPA and HERWIG + + genera- 
tors have different behaviour for low Z p± region: SHERPA shows more events concentrated 
in the region of {Z p± - jets p±) = -17 for the leading jet and {Z p± - jets p±) in the range 
of [-5, 0] for the sum of jets, while HERWIG + + shows a behaviour that is very similar to 
that without the MPI model, in Fig. 17, upper plots. 



Fig. 17 shows that for the MPI model off in the generators, the behaviour is similar. 



However in Fig. 16 there is more p± in the jets, so this isn't balancing exactly all the 



transverse momentum that comes from the Z boson. Specially in the low region of the 
boson p±, we are probably seeing additional particles from MPI that don't contribute to 
the Z p± but do contribute to the jet p±. 
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Figure 16. Difference between Z p± and: sum of jets p± (left) and leading jet p± (right), as a 
function of bins of Z p± for HERWIG + + (upper) and SHERPA (lower), MPI turned on in default 
for HERWIG + + and with CTEQ6L1 PDF, scale of 2.5 GeV and optmized K_PERP in SHERPA. The 
cuts on the analysis are, for the leading jet, the same as Tevatron analysis: p±{jet) > 20 GeV and 
\t]\ < 2.1, and for the sum of jets, p±(jet) > 5 GeV and |ry| < 10. At least one jet is required. 



For a better comparison of the plots, we also make one-dimensional projections of the 
plots for three different Z p± regions {Z p±< 30 GeV, 30 GeV < Z p± < 100 GeV, Z p± 
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Figure 17. Difference between Z p± and: sum of jets p± (left) and leading jet p± (right), as a 
function of bins of Z pj_ for HERWIG + + (upper) and SHERPA (lower), MPI turned off. HERWIG + + 
has default PDF adjustment, and SHERPA has CTEQ6L1 PDF and optimized K_PERP. The cuts 
on the analysis are, for the leading jet, the same as Tevatron analysis: p±(jet) > 20 GeV and 
< 2.1, and for the sum of jets, p±(jet) > 5 GeV and \r]\ < 10. At least one jet is required. 



> 100 GeV). These plots (Figs. 18 and 19) make more evident the difference that one can 
see, specially in the lower p± region, between the generators with MPI model turned on. 
SHERPA and HERWIG + + disagree from each other in the balance for both the sum of all 
jets p± and leading jet p±. However, they agree reasonably when there is no simulation of 
multiple parton interactions. This makes more evident that these models should be better 
studied and implemented in the context of the Drell Yan production, as have already been 
shown concerning the HERWIG + + generator in the UE analysis. 
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Figure 18. The difference between Z p± and the sum of all jets p± for three regions of boson p±: 
Z p ± < 30 GeV, 30 GeV < Z p ± < 100 GeV, Z p ± > 100 GeV. The cuts on the analysis are 
P±(jet) > 5 GeV and |?7| < 10. Generator parameters are the same as in Figs. 16 and 17 



Leading Jet Rapidity for jet p± > 5 GeV The MPI model in each simulation is 
clearly producing additional jets in Z events, and we may expect these MPI jets to have a 
different angular distribution to jets from the hard process. So finally we check the rapidity 
distribution of all jets with p± > 5 GeV. It can be seen that, when the MPI are turned on 
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Figure 19. The difference between Z p± and the leading jet p± for three regions of boson p±: Z 
p±< 30 GeV, 30 GeV < Z p ± < 100 GeV, Z p ± > 100 GeV. The cuts on the analysis are the same 
as Tevatron analysis: p±(jet) > 20 GeV and |7j| < 2.1. Generator parameters are the same as in 
Figs. fT6landfT7l 



in the simulation, the shape is different in the central region: around the pseudorapidity 
zero, we have a low bump, when compared to the plots without MPI simulation (Fig. 20 
left). 

The behaviour is almost gone when the cut on the jets is that of the analysis (Fig. 
20 right) - the MPI model affects specially the production of particles in low transverse 
momentum region. Once more, the MPI model in the generators disagree between them- 
selves. While such a low p± region is experimentally difficult or impossible to access, it is 
clearly a very sensitive region for MPI tuning and data with p± cuts as low as possible are 
very interesting. 
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Figure 20. Leading jet rapidity for jet cuts on tranverse momentum of 5 GeV (left), and analysis 
cut of 20 GeV (right). HERWIG + + has default parameters, SHERPA has CTEQ6L1 PDF, optimized 
K PERP, and when MPI is on. scale of 2.5 GeV. 
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